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Abstract 

We consider a one dimensional ballistic random walk evolving in an 
i.i.d. parametric random environment. We provide a maximum likelihood 
estimation procedure of the parameters based on a single observation of 
the path till the time it reaches a distant site, and prove that the estima- 
tor is consistent as the distant site tends to infinity. We also explore the 
numerical performances of our estimation procedure. 

Key words : Ballistic regime, maximum likelihood estimation, random walk in 
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1 Introduction 

Random walks in random environments (RWRE) have attracted much attention 
lately, mostly in the physics and probability theory literature. These processes 



were introduced originally by lChernovl 819671) to model the replication of a DNA 



sequence. The idea underlying Chernov's model is that the protein that moves 
along the DNA strand during replication performs a random walk whose tran- 
sition probabilities depend on the sequence letters, thus modeled as a random 
environment. Since then, RWRE have been developed far beyond this original 
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motivation, resulting into a wealth of fine prob abilistic results. Some recent sur- 
veys on the subject include Hughes! 19961) and Zeitouni 2004 ) . 



Recently, these models have regained interest from biophysics, as they fit the de- 
scription of some physical experiments that unzip the double strand of a DNA 
molecule. More precisely, some fifteen years ago, the first experiments on un- 
zipping a D NA sequence have been co nducted, relying on several different tech- 
niques f see Baldazzi et al. , 20061. 2007 . and the references therein). By that time, 
these experiments primarily took place in the quest for alternative (cheaper.faster 
or both) sequencing methods. When conducted in the presence of bounding 
proteins, such experiments also enabled the iden tification of spec ific locations 
at which proteins and enzymes bind to the DNA IKoch et alJ.l2002» . Nowadays, 
similar experiments are conducted in ord er to investigate molecular free energy 
landscapes with unprecedented accuracy Alemany et al. , 20121; Hu guet et al. , 20091) . 
Among other biophysical app lications, one can m ention the study of the forma- 
tion of DNA or RNA hairpins IBizarro et al.l.l2Q12T) . 



Despite the emergence of data that is naturally modeled by RWRE, it appears 
that very few statistical issues on tho se processes have been studied so far. Very 
recently, lAndreoletti and Diell 120120 considered a prob l em inspired by an ex 



perim ent on DNA unzipping Baldazzi et al. , 20061. 2007 : Cocco and Monasson 



2008), where the aim is to predict the sequence of bases relying on the obser- 
vation of several unzipping of one finite length DNA sequence. Up to some ap- 
proximations, the problem boils down to considering independent and identi- 
cally distributed (i.i.d.) replicates of a one dimensional nearest neighbour path 
[i.e. the walk has +1 increments) in the same finite and two -sites dependent 
environment, up to the time each path reaches some value M (the sequence 
length). In this setup, the authors consider both a discrete time and a continu- 
ous time model. They provide estimates of the values of the environment at each 
site, which corresponds to estimating the sequence letters of the DNA molecule. 
Moreover, they obtain explicit formula for the probability to be wrong for a given 
estimator, thus evaluating the quality of the prediction. 

In the present work, we study a different problem, also motivated by some DNA 
unzipping experiments: relying on an arbitrary long trajectory of a transient 
one -dimensional nearest neighbour path, we would like to estimate the param- 
eters of the environment's distribution. Our motivation comes more precisely 
from the most recent experiments, that aim at characterising free binding ener- 
gies between base pairs reivin g on the unzipping of a synthetic DNA sequence 
Ribezzi-Crivellari et alJ.l201ll) . In this setup, the environment is still considered 
as random as those free energies are unknown and need to be estimated. While 
our asymptotic setup is still far from corresponding to the reality of those exper- 
iments, our work might give some insights on statistical properties of estimates 
of those binding free energies. 

The parametric estimation of the environment distribution has already been 
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studied in lAdelman and Enriquea (200J). In their work, the authors consider 
a very general RWRE and provide equations relying the distribution of some 
statistics of the trajectory to some moments of the environment distribution. 
In the specific case of a one -dimensional nearest neighbour path, those equa- 
tions give moment estimators for the environment distribution parameters. It is 
worth mentioning that due to its great generality, the method is hard to under- 
stand at first, but it takes a simpler form when one considers the specific case 
of a one -dimensional nearest-neighbour path. Now, the method has two main 
drawbacks: first, it is not generic in the sense that it has to be designed differ- 
ently for each parametric setup that is considered. Namely, the method relies 
on the choice of a one-to-one mapping between the parameters and some mo- 
ments. Note that the injectivity of such a mapping might even not be simple 
to establish (see for instance the case of Example|Il]below, further developed in 
Section [5711 . Second, from a statistical point of view, it is clear that some map- 
pings will give better results than others. Thus the specific choice of a mapping 
has an impact on the estimator's performances. 

As an alternative, we propose here to consider maximum likelihood estimation 
of the parameters of the environment distribution. We consider a transient near- 
est neighbour path in a random environment, for which we are able to define 
some criterion - that we call a log-likelihood of the observed process, see {8} be- 
low. Our estimator is then defined as the maximiser of this criterion - thus a 
maximum likelihood estimator. When properly normalised, we prove that this 
criterion is convergent as the size of the path increases to infinity. This part of 
our work relies on using the link between RWRE and branching processes in 
random environments (BPRE). While this link is already well-known in the liter- 
ature, we provide an explicit characterisation of the limiting distribution of the 
BPRE that corresponds to our RWRE (see theorem 14.51 below). Relying on this 
precise characterisation, we then further prove that the limit of our normalised 
criterion is finite in what is called the ballistic region, namely the set of param- 
eters such that the path has a linear increase (see Section 12.11 below for more 
details). Then, following standard statistical results, we are able to establish the 
consistency of our estimator. We also provide synt hetic experiments to com pare 
the effective performances of our estimator and lAdelman and Enriquea s pro- 
cedure. In the cases where Adelman and Enrique j 's estimator is easily settled, 
while the two methods exhibit the same performances with respect to their bias, 
our estimator exhibits a much smaller variance. We mention that establishing 
asymptotic normality of this estimator requires much more technicalities and is 
out of the scope of the present work. This point will be studied in a companion 
article, together with variance estimates and confidence intervals. 

The article is organised as follows. Section lZTl introduces our setup: the one di- 
mensional nearest neighbour path, and recalls some well-known results about 
the behaviour of those processes. Then in Section lZ2l we present the construc- 
tion of our M-estimator {i.e. an estimator maximising some criterion function), 
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state the assumptions required on the model as well as our consistency result 
(Section [273T l . Section [3] presents some examples of environment distributions 
for which the model assumptions are satisfied so that our estimator is consis- 
tent. Now, the proof of our consistency result is presented in Section |H The 
section starts by recalling the link between RWRE and BPRE (Section l4~ll . Then, 
we state our core result: the explicit characterisation of the limiting distribution 
of the branching process that is linked with our path; and its corollary: the ex- 
istence of a (possibly infinite) limit for the normalised criterion (Section|4j2}. In 
Section 1431 we first provide a technical result on the uniformity of this conver- 
gence, then establish that in the ballistic case, the limit of the normalised crite- 
rion is finite. An almost converse statement is also given (Lemma[4j9). To con- 
clude this part, we prove in Section l4~4l that the limiting criterion identifies the 
true parameter value (under a natural identifiability assumption on the model 
parameter). Finally, numerical experiments are presented in Section l52l focus- 
ing on the three examples that were develo ped in Sectional Note th at we also 
provide an explicit description of the form o flAdelman and Enriquezr s estimator 
in the particular case of the one- dimensional nearest neighbour path in Sec- 
tionl57Ll 



2 Definitions, assumptions and results 
2. 1 Random walk in random environment 

Let a) = {a) x }x£Z be an independent and identically distributed (i.i.d.) collection 
of (0, 1) -valued random variables with distribution v. The process a) represents 
a random environment in which the random walk will evolve. We suppose that 
the law v = vg depends on some unknown parameter 6 e @, where c i d is 
assumed to be a compact set. Denote by P e = v^ z the law on (0, l) z of the envi- 
ronment {tojJxez and by E e the expectation under this law. 

For fixed environment a), let X = {X f } tE M be the Markov chain on Z starting at 
Xo = and with transition probabilities 

{a> x ify = x + l, 
l-(o x ify = x-l, 
otherwise. 

The symbol P w denotes the measure on the path space of X given a), usually 
called quenched law. The (unconditional) law of X is given by 

P 9 (-)= f P w (-)dP 9 M, 

this is the so-called annealed law. We write E m and E e for the corresponding 
quenched and annealed expectations, respectively. We start to recall some well- 
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known asymptotic results. Introduce a family of i.i.d. random variables, 



l-co> 



CD, 



xeZ, 



(1) 



and assume that log po is integrable. Solomon 19751) proved the following clas- 
sification: 



(a) ifE y (logp ) <0, then 

lim X t = +00, P -almost surely. 

t— >oo 



(b) IfE y (logp ) =0,then 

n 

-00 = liminf X t < lim sup X t = +00, P -almost surely. 



The case of E (logp ) > follows from (a) by changing the sign of X. Note that 
the walk X is P e -almost surely transient in case (a) and recurrent in case (b). 

In the present paper, we restrict to the case (a) when X is transient to the right. 
Then, it was also found that the rate of its increase (with respect to time t) is 
either linear or slower than linear. The first case is called ballistic case and the 
second one sub-ballistic case. More precisely, letting T n be the first hitting time 
of the positive integer n, 

T n =inf{f eN : X t = n}, (2) 
and assuming E 9 (log po) < all through, we have 

(al) if E e (p ) < 1, then, P e -almost surely, 

Tn > l + E 8 (Po) 

n n^oo i-E e (p )' 



(a2) If E (p ) > 1, then T n /n^ +00 P -almost surely, when n tends to infinity. 



2.2 Construction of a M-estimator 

We address the following statistical problem: estimate the unknown parameter 
from a single observation of the RWRE path till the time it reaches a distant site. 
Assuming transience to the right, we then observe X[o,r„] = {X t : t = 0, 1, . . ., T n }, 
for some n>l. 
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If X[o,r] := {XQ,...,x t ) is a nearest neighbour path of length t, we define for all 

xeZ, 

t-i 

L(x,x [0 , f ]) := £ Ux s = x; x s+1 = x- 1}, (4) 

s=0 

t-i 

and R{x,x [0it] ):=Y J ' i -{ x s = x;x s+l = x + l}, (5) 

s=0 

the number of left steps (resp. right steps) from site x. (Here, 1{-} denotes the 
indicator function). We let also v t (resp. Vj n ) be the set of integers visited by the 
pathx[ 0) r] (resp. X[o,r„])- Consider now a nearest neighbour path X[o,f B ] starting 
from and first hitting site n at time t n . It is straightforward to compute its 
quenched and annealed probabilities, respectively 

^coi^-[0,T„] -X[0,f„]J - 11 0i x (l-(O x ) 

xev tn 

and 

P e (X [0 , t „] =x [0 , f „])= n f « fl(x ' Xia ' nl) (l-fl) i(x ' X|0 ' f " l) dv e (a). 
xev, n J0 

Under the following assumption, these weights add up to 1 over all possible 
choices of X[o,t n ]- 

Assumption I. (Transience to the right). For any 9 e 0, E e | log p 1 < oo and 

E 9 (logp )<0. 

Introducing the short-hand notation 

L x :=Ux,X [0>Tn] ) and R n x := R{x,X [QiTn] ), 
we can express the (annealed) log-likelihood of the observations as 

hid) = E lQ g f * " a) L "dveta) + £ log f 1 a fl "(l - a) L "dv e {a). (6) 

x=0 JO x<0,xeV Tn Jo 

Note that as the random walk X starts from (namely Xo = 0) and is observed 
until the first hitting time T n of n > 1, we have R x = L" +l + 1 for x = 1,2, . .., n - 
1. We will perform this change in the first line of the right-hand side of ©. 
Also, since the walk is transient to the right (Assumption [I}, the second sum in 
the right-hand side (accounting for negative sites x) is almost surely bounded. 
Hence, this sum will not influence in a significant way the behaviour of the nor- 
malised log-likelihood, and we will drop it. Therefore, we are led to the following 
choice. 
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Definition 2.1. Letcpo be the function from N 2 to U given by 

e (x,y)=log f a x+1 Cl-fl) y dv fl (fl). (7) 
Jo 

The criterion function >-> £„ (0) is defined as 

£„m = n t(f>0{L n x+v L^, (8) 
that is the first sum (dominant term) in [Bp. 

We maximise this criterion function to obtain an estimator of the unknown pa- 
rameter. To prove convergence of the estimator, some assumptions are further 
required. 

Assumption II. (Ballistic case). For any e 0, E 9 (po) < 1. 

As already mentioned, Assumption Q] is equivalent to the transience of the walk 
to the right, and together with Assumption^ it implies positive speed. 

Assumption III. ( Continuity). For any x, y e N, the map >—■ cf>Q (x, y) is continu- 
ous on©. 

Assumption |TTT] is equivalent to the map >—■ vq being continuous on with 
respect to the weak topology. 

Assumption IV. (Identifiability). V(0,0') £ 2 , v e ± v B , <=> ^ 0'. 
Assumption V. The collection of probability measures {vg : e 0} is such that 

infE e [log(l-a»o)] > -oo. 

0E0 

Note that under Assumption [II] we have E e [loga»o] > -log2 for any e 0. As- 
sumptions [HI] and |V| are technical and involved in the proof of the consistency 
of our estimator. Assumption [IV] states identifiability of the parameter with 
respect to the environment distribution vq and is necessary for estimation. 

According to Assumption [TTTl the function >-» £ n {9) is continuous on the com- 
pact parameter set 0. Thus, it achieves its maximum, and we define the estima- 
tor 6 n as a maximiser. 

Definition 2.2. An estimator „ of 6 is defined as a measurable choice 

0„eArgmax^(0). (9) 

Note that 6„ is not necessarily unique. 

Remark 2.3. The estimator 8 n is a M -estimator, that is, the maximiser of some 
criterion function of the observations. The criterion £ n is not exactly the log- 
likelihood for we neglected the contribution of the negative sites. However, with 
some abuse of notation, we call 9 n a maximum likelihood estimator. 
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2.3 Asymptotic consistency of the estimator in the ballistic case 

From now on, we assume that the process X is generated under the true param- 
eter value 6* , an interior point of the parameter space 0, that we want to esti- 
mate. We shorten to P* and E* (resp. P* and E*) the annealed (resp. quenched) 
probability P 9 (resp. P e ) and corresponding expectation E 9 (resp. E e ) under 
parameter value 6* . 

Theorem 2.4. (Consistency). Under AssumptionsQto[$ for any choice of 6 n sat- 
isfying (9), we have 

lim e n = e*, 

n—>oo 

in P* -probability. 



3 Examples 

3. 1 Environment with finite and known support 

Example I. Fix a\ < az e (0, 1) and letv = p8 ai + (1 - p)8 a2 , where 5 a is the Dirac 
mass located at a. Here, the unknown parameter is the proportion p e c [0, 1] 
(namely 9 = p). We suppose thata\, az and® are such that Assumptions\^and^ 
are satisfied. 

This example is easily generalised to v having m > 2 support points namely v = 
i Pi^cii! where a\, a m are distinct, fixed and known in (0, 1), we let p m = 
^-YJJXi Pi and the parameter is nowfl = (pi,...,p TO -i)- 
In the framework of Example^ we have 

(f> p {x,y) = log[p< +1 (l - a x y + (1 - p)a% +1 {l - a 2 ) y ], (10) 

and 

r j" 4-1 I" t" 4.1 r»l 

(ID 



£ n {p):=£ n {6)= £>g pay" +1+ \l-a 1 ) L " + a-p)4" +1+ \l-a 2 ) L " 



x=0 

Now, it is easily seen that Assumptions HTT1 to fVl are satisfied. Coupling this point 
with the concavity of the function p^ £ n (p) implies that p n = Argmax pee £ n {p) 
is well-defined and unique (as is a compact set). There is no analytical expres- 
sion for the value of p n . Nonetheless, this estimator may be easily computed by 
numerical methods. Finally, it is consistent from Theorem l2.4l 



3.2 Environment with two unknown support points 

Example II. We let v = p8 ai + (1 - p)5 a2 and now the unknown parameter is 
6 - {p, a\, az) £ 0, where© is a compact subset of 

(0, 1) x {{ai,a 2 ) e (0, l) 2 : a x < a 2 }. 
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We suppose that® is such that Assumptions\^and^are satisfied. 



This case is particularly interesting as it corresponds to one of the setups in 
the DNA unzipping experiments, namely estimating binding energies with two 
types of interactions: weak or strong. 

The function cpg and the criterion £ n {-) are given by (TO) and QTJ, respectively 
It is easily seen that Assumptions [TlTI to N\ are satisfied in this setup, so that the 
estimator 8 n is well-defined. Once again, there is no analytical expression for 
the value of 6 n . Nonetheless, this estimator may also be easily computed by 
numerical methods. Thanks to Theorem l2.4l it is consistent. 

3.3 Environment with Beta distribution 

Example HI. We letv be a Beta distribution with parameters (a, p), namely 

1 f 1 

dv(a) = -/^(l-fl/'Mfl, B(a,fi)= \ t a ~ l {l-tf- l dt. 

B(a,j8) Jo 

Here, the unknown parameter is 8 = (a, /3) e where is a compact subset of 

{(a,/3) e (0,+oo) 2 : a>/3 + l}. 

As E e (po) = j6/(a - 1), the constraint a > (3 + 1 ensures that Assumptions^ andUL 
are satisfied. 

In the framework of Example iHll we have 

B(x + 1 + a,y + B) 



and 



n-l 

(,M = -n\o%B{a,p) + X logB(L^ +1 + a + 1, L n x + p) 

x=0 

_"" 1 io {L n x+1 + a) [L n x+l + a - 1) . . . a x (L n x + jg - 1) {L n x +p -2) . . . g 
" x to° 8 (L^ +1 +L2+a+^-l)(i" + i+i2+«+)6-2)...(a + ) 6) 

In this case, it is easily seen that Assumptions|Ill]to|V|are satisfied, ensuring that 
6 n is well-defined. Moreover, thanks to Theorem l2.41 it is consistent. 



4 Consistency 

The proof of Theorem 12.41 relies on classical theory about the convergence of 
maximum likelihood estimators, as stated for instance in the classical approach 
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b vlWaldl h949h for i.i.d. random variables. We refer for instance to Theorem 5.14 



m 



van der Vaartl 119980 for a simple presentation of Wald's approach and further 
stress that the proof is valid on a compact parameter space only. It relies on the 
two following ingredients. 

Theorem 4.1. Under Assumptions^ fo[B there exists a finite deterministic limit 
£{6) such that 

-£ n {6) >£{6) in P* -probability, 

and this convergence is "locally uniform" with respect to 0. 

The sense of the local uniform convergence is specified in Lemma l4"77l in Subsec- 
tion !4.3l and the value of £{6) is given in Q7}. 

Proposition 4.2. Under Assumptions^ £o[0 for any £ > 0, 

sup £{6) < £(0*). 

0:||0-0*||>e 

Theorem l4. 1 l induces a pointwise convergence of the normalised criterion £ n ln 
to some limiting function £, and is weaker than assuming uniform convergence. 
Proposition l4.2l states that the former limiting function £ identifies the true value 
of the parameter 6* , as the unique point where it attains its maximum. 

Here is the outline of the current section. In Subsection 14.11 we recall some 
preliminary results linking RWRE with branching processes in random environ- 
ment (BPRE). In Subsection 14.21 we define the limiting function £ involved in 
Theorem l4 . 1 1 thanks to a law of large numbers (LLN) for Markov chains. In Sub- 
sections |4j3]and|421 we prove Theorem 14. II and Proposition 14.21 respectively. It 
is important to note that the limiting function £ exists as soon as the walk is 
transient. However, it is finite in the ballistic case and everywhere infinite in the 
sub-ballistic regime of uniformly elliptic walks, see Lemma l4~9l This latter fact 
prevents the identification result stated in Proposition 14.21 and explains why we 
obtain consistency only in the ballistic regime. From all these ingredients, the 
consistency of 6 n , that is, the proof of Theorem l2.4l easily follows. 



4. 1 From RWRE to branching processes 



We start by recalling some already known results linking RWRE with branch- 
ing process es in random envir onment (BPRE). Indeed, it has been previously 
observed in iKesten et alJ 119751) that for fixed environment a) = {a) x } xf£ z, under 
quenched distribution P^, the sequence L",L"_ 1 ,...,Lq of the number of left 
steps performed by the process X[o,t„] from sites n, n — 1, .. ., 0, has the same dis- 
tribution as the first n generations of an inhomogeneous branching process with 
one immigrant at each generation and with geometric offspring. 
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More precisely, for any fixed value n e N* and fixed environment a), consider a 
family of independent random variables {£ k j : k e {1, . .., n}, i e N} such that for 
each fixed value k e {1, . . . , n}, the {^/t.JieN are i.i.d. with a geometric distribution 
on N of parameter (i) n - k , namely 

Vm e N, Patfk.i = Tri) = {l-(o n - k ) m h) n - k . 

Then, let us consider the sequence of random variables {Zj?}fc = o„., w defined re- 
cursively by 

Z " = 0, andforfc = 0,...,/z-l, Z£ +1 = £f fc+u . 

!=0 

The sequence {Z?}jt=o,...,n forms an inhomogeneous BP with immigration (one 
immigrant per generation corresponding to the index i = in the above sum) 
and whose offspring law depends on n (hence the superscript n in notation Z£). 
Then, we obtain that 

fjn tU jU\ , 7 n 7 n 7)1-, 

^n'^n-V-'^O' ~ ^0 '^1 '■■■< z <n ) > 

where ~ means equality in distribution. When the environment is random as 
well, and since [(Do,...,a) n ) has the same distribution as (o»„,...,o»o), it follows 
that under the annealed law P*, the sequence L%,L*_ v ...,Lfi has the same dis- 
tribution as a branching process in random environment (BPRE) Z ,...,Z n , de- 
fined by 

z k 

Z = 0, and for k = 0,...,n, Z fc+1 = £ ? k+lii , (13) 

i=0 

with .}fceN*;ieN independent and 

VmeN, P a) {{' kJ = m) = {l-a> k ) m a) k . 

Now, when the environment is assumed to be i.i.d., this BPRE is under annealed 
law a homogeneous Markov chain. We explicitly state this result because it is 
important; however its proof is immediate and therefore omitted. 

Proposition 4.3. Suppose that {to n } nE ^ are i.i.d. with distribution vq. Then the 
sequence {Z n } ne ^ is a homogeneous Markov chain whose transition kernel Qq is 
given by 



Qe(x,y) = 



x + y\ 


e <M*,y) _ 


'x + y\ 


, x J 




, x I 



l a-a) y dv e {a). (14) 



Finally, going back to © and the definition (7) of <pg , the annealed log-likelihood 
satisfies the following equality 

n-i 

( n m ~ E <MZ fc ,Z fc+1 ) under P*. (15) 

k=0 
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Remark 4.4. Up to an additive constant (not depending on 6), the right-hand 
side of fT5t is exactly the log-likelihood of the Markov chain {Zjfc} <fc< n . Indeed, 
we have 



log Q (x,y) =log 



'x + y* 



+ (peix,y), Vx,yel 



We prove in the next section a weak law of large numbers for {(pei^k> ^k+\)) keN 
and according to (T5) , this is sufficient to obtain a weak convergence of £ n {&) I n. 



4.2 Existence of a limiting function 



It was shown by lKeyl (Theorem 3.3, Il987l) that under Assumption Q] (and for a 



non-necessarily i.i.d. environment), the sequence {Z„}„ e w converges in annea- 
led law to a limit random variable Z which is a lmost surely finite. An explicit 



construction of Z is given by Equation (2.2) in iRoitershteinl <2007l) . In fact, a 
complete stationary version {Z n } n<£ z of the sequence {Z n }„ e ^ is given and such a 
construction allows for an ergodic theorem. In the i .i.d. environment setup , we 



obtain more precise results than what is provided by lKeyl (Theorem 3.3. I1987I) . as 
{Z ra }neN is a Markov chain. Thus Theorem l4.5l below is specific to our setup: ge- 
ometric offspring distribution, one immigrant per generation and i.i.d. environ- 
ment. We specify the form of the limiting distribution of the sequence {Z„}„ £ n 
and characterise its first moment. We later rely on these results to establish a 
strong law of large numbers for the sequence {(f>g{Zi c ,Z] c+ ])}i CE .N- 

Theorem 4.5. Under Assumption^ for all 9 e© the following assertions hold 

i) The Markov chain {Z„}„ £ m is positive recurrent and admits a unique in- 
variant probability measure ng satisfying 

lim P e (Z„ = k) = n e {k), VfceN. 

n— »oo 

ii) Moreover, for all keN, we have ng {k) = E 6 [S(l - S) k ] , where 

S:= (1 + pi +pip 2 + --- + P\...pn + ---)~ l e (0,1). 

In particular, we have Y.kEN kng{k) = Ln>i(IE e po)"> and the distribution ng 
has a finite first order moment only in the ballistic case. 

Proof. We introduce the quenched probability generating function of the ran- 
dom variables £ ' . and Z n introduced in {T3j , respectively defined for any u e 
[0, 1] by 

tf a (»):=£ a ("H = ; ,!° n ; > and F n (u) := E M [u z ") , 
V I \~{\-(i)n)U 
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as well as the quantities S„ and S n denned as 

S^ 1 = l+Pn + PnPn-l + -" + Pn---Pl, 
S" 1 = l + p 1 +p 1 p 2 + --- + P\...pn- 

According to (131 , we have 

F n+ i{u) = F n [H n+1 {u)] x H n+1 {u), 

and by induction 

„ , , (Ol...(O n 



A n {(D) - B n {(o)u' 
where A n and B n satisfy the relations 

f A n+l {cu) = A n {cu) - B„{oj) x cu n+l , 
\ B n+ i{a)) = A n {cj) x (l-a) n+l ). 

A simple computation yields 

A n {LD) = CD l ...a) n S~ 1 and B n {oo) = (Oi ...ft^CS" 1 - 1). 

Finally, we have for any u e [0, 1] 



F„[u) 



i-a-s n )u 



This means that under quenched law P^, the random variable Z n follows a geo- 
metric distribution on N with parameter S n . Note that S n and S„ have the same 
distribution under P e , implying that F n [u) has the same distribution as 

Sn 



1-(1- S n )u 

Under Assumption!!] we have P e -a.s. 

1 1 n 

lim -log(pi...p„) = lim - Y logp, = E e logp := m < 0, 

and hence 

P e (3nM, s.t.\/n>n{(0), p l ...p n <e nm/2 ) = l. 

Then, as n — +oo, S„ \ S = {l+p 1 +p 1 p 2 +...)~ 1 P e -a.s. withP> 9 (0 < S < 1) = 1. As 
a consequence, the quenched probability generating function F n {u) converges 
in distribution under P to 

S 

F{u) = 



1-(1-S)u 



the probability generating function of a geometric distribution with parameter 
S. Under annealed law, for any k e N we have 



13 



V a {Z n = k) = E a P t) {Z n = k) = E° 



$n (1 Sri) 



— E 



Sn{l-Sn) 



Since < S n < 1, dominated convergence implies that for all k e I 



lim P u {Z n = k) = E > 



sa-sy 



:=n e {k). 



(16) 



As an immediate consequence, we obtain 

oo 

Y,k7i e {k) = E e (S- 1 -l)=Y,^ e Po) n > 

fceN n=l 

Moreover, by Fubini-Tonelli's theorem and P e (0 < S < 1) = 1, we have 
Y,ng{k) = l and 7i e {k)>0, VfceN. 

fceN 

Thus the measure ng on N is a probability measure and thanks to (16), it is in- 
variant. We note that {Z„}„ e m is irreducible as the transitions Qg{x, y) defined by 
(14} are positive and the measure vg is not dege nerate. Thus, the chain is pos- 
itive recurrent and ng is unique (see for instance iNorrisl. Il998l. Theorem 1.7.7). 
This concludes the proof. □ 

Let us define {Z n }neN as the stationary Markov chain with transition matrix Qg* 
defined by fi"4l and initial distribution n* := ng* introduced in Theorem l4.5l It 
will not be confused with {Z„}„ £ m from (13). We let £{0) be defined as 



m=E*[00(Z o ,Zi)]e[-oo,O], 



(17) 



where (pg is defined according to (7J . (Note that the quantity £[6) may not neces- 
sarily be finite). As a consequence of the irreducibility of the chain {Z„} n E M and 
Theo rem l4.5l we obtain the following ergodic theorem (see for instance 



Norris, 



1998, Theorem 1.10.2). 



Proposition 4.6. Under Assumption for all 9 e ©, the following ergodic theorem 
holds 



1 



n-l 



lim — 5^ 0e(Zfc,Zfc + i) = £{8) P* -almost surely. 



fc=0 



4.3 Local uniform convergence and finiteness of the limit 

According to (T5j and Proposition l4.6l we obtain 

lim -e n {d) =£{6) in P* -probability. (18) 

n->oo n 

To achieve the proof of Theorem 14. II it remains to prove that the convergence 
is "locally uniform" and that the limit £{8) is finite for any value of 8. The local 
uniform convergence is given by Lemma l477l below while Proposition l4.8l gives a 
sufficient condition for the latter fact to occur. 
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Lemma 4.7. Under Assumption^ the following local uniform convergence holds: 
for any open subset [/ c 0, 

1 " _1 / \ 

— V sup (p B {L" +l , L") ► E* sup (p e (Z , Zi) in P* -probability . 

Proof of Lemma WK Let us fix an open subset U c and note that 

n x=oeau n k=o 

where we have $[/ := sup 9e(7 (£g. As the function $>u is non-positive, the expec- 
tation E* ($[/(Zo, Zi)) exists and relying again on the ergodic theorem for Markov 
chains, we obtain the desired result. □ 

Proposition 4.8. (Ballistic case). As soon as 

E*(p )<l, (19) 
the limit ({0) is finite for any valued e 0. 

Proof of Proposition \4M For all x e N, y e N, by using Jensen's inequality, we may 
write 

log f 1 a x+ \l - a)ydve(a) > (x+ l)E 9 [log(w )] + yE e [logd - w )]. (20) 
Jo 

This implies that for any k e N, 

(f>g{Z k ,Z k+1 ) > (Zfc + l)E e [log(wo)] + Z t+1 E e [log(l - wo)}, 
and in particular 

- £ tf> g [Z k ,Z k+l ) > E e [log(w )]- E (Z k + 1) + E 9 [log(l - w )]- £ Z k+l . (21) 

n k=0 n k=0 n fc=0 

Now, as a consequence of Theorem l4.51 we know that in the ballistic case given 
by {19} the expectation E*(Z ) is finite. From the ergodic theorem, P*-almost 
surely, 

-J^(Z k + l) ► E*(Zb) + 1 and -£z fc+1 >E*(Z ). (22) 

Combining this convergence with the lower bound in CTH . we obtain £ [6] e 
(-oo, 0] in this case. □ 

The next lemma specifies that condition {T9} is necessary for ({6) to be finite at 
least in a particular case. 
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Lemma 4.9. ( Converse result in the uniformly elliptic case). Assume thatvg{[5, 1 - 
5]) = 1 for some 6 > and all 8 e© (uniformly elliptic walk). Then, in the sub- 
ballistic case, thatisE*{po) > 1, the limit £ [6) is infinite for all parameter values. 

Proof. For any integers x and y and any a in the support of vg, we have 
< 5 X+1 < a x+1 < (1 - <5) x+1 , < $y < (1 - a)y < (1 - fl* 

and then 

(x + y + l)log(<S) <log [ a x+l (l-a) y dv g {a) < (x + y + l)log(l -5). 
Jo 

This implies that for any e N, 

(Z fc + Z fc+1 + 1) log(<?) < (Z fc , Z fc+ i) < (Z fc + Z fc+1 + 1) log(l - 8), 
and in particular 

-£0 e (Z fc ,Z fc+1 )<log(l-5)-£(Z fc + Z fc+1 + l). (23) 

n k=0 n k=0 

Combining the convergence <22l with the above upper bound implies that as 
soon as £{6) > -oo, we get E*(Zo) < +oo and according to point Hi) in Theo- 
rem !4.5l this corresponds to the ballistic case [TjD ■ □ 

4.4 Identification of the true parameter value 

Fix £ > 0. We want to prove that under Assumptions[I]to|V| 

sup £{6) < £{6*). 
0:||0-0* U >e 

First of all, note that according to Proposition 14.81 Assumption |II] ensures that 
£{8) is finite for any value 8 e&. 

Now, we start by proving that for any 6 e 0, we have £[6) < £[&*). According 
to Q7}, we may write 

£{8) - £{6*) = E* [0 e (Z o , Zi) - (/>g* (Zo, Zi)], 

which may be rewritten as 

Using Jensen's inequality with respect to the logarithm function and the (condi- 
tional) distribution Qg* (x, •) yields 

£{d)-£(6*)< £>*(x)log(£ -^^-Q e *(x,y))=0. (24) 



±n lQe*(*,y)J 



16 



The equality in l l24t occurs if and only if for any x e N, we have Qg[x, ■) = Qg* [x, ■), 
which is equivalent to the probability measures vg and vq* having identical mo- 
ments. Since their supports are included in the bou nded set (0, 1), t hese proba- 



bility measures are then identical (see for instance IShiryaevl 119961. Chapter II, 
Paragraph 12, Theorem 7). Hence, the equality £{8) = £{8*) yields vg = vg* 
which is equivalent to 8 = 8* from Assumption^ 

In other words, we proved that £{8) < £[8*) with equality if and only if 8 = 6*. 
To conclude the proof of Proposition l4.2l it suffices to establish that the function 
8 1-* £{8) is continuous. 

From Inequality {20) and Assumption|V| we know that there exists a positive con- 
stant A such that for any 8 £&, 

Under Assumption HH we know that E*(Zo) = E*(Zi) is finite, and under As- 
sumption [nil the function 8 >—■ cf>g{x,y) is continuous for any pair (x, y). We 
deduce that the function >— £{8) is continuous. 



5 Numerical performances 



In this section, we explore the numerical performances of our estimation pro- 
cedure and compare them wi th the performances of the estimator proposed by 
Adelman and Enriquea <2004l) . As this latter procedure is rather involved and far 
more general than ours, we start by describing its form in our specific context in 
Section [5TTJ The simulation protocol as well as corresponding results are given 
in Section[521 where we focus on Examplesllltolim 



5. 1 Estimation procedure of Adelman and Enriquez 12004 ) 



The estimator proposed by lAdelman and Enriquea <2004l) is a moment estima- 
tor. It is based on collecting information on sites displaying some specified his- 
tories. We shortly explain it in our context: the one dimensional RWRE. 

Let H{t, x) denote the history of site x at time t defined as 
H{t,x) = {Ux,X l0i t]),R{x,X [0 ,t])), 

where L(x,X[o, f j) and R{x,X[o t t]) are respectively defined by S) and {5}, and rep- 
resent the number of left and right steps performed by the walk at site x until 
time t. Note that H{0, x) = (0, 0) for any site x. 

We define H[t) as the history of the currently occupied site X t at time t, that is 



mt)=mt,x t 
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For any h = [h-,h+) e N 2 , let {K^}i>o be the successive times where the history 
of the currently occupied site is h: 

Kg = M{t > : H{t) = h}, K? +1 = inf{?> Kf : H{t) = h}. 

Define A* 1 with values in {-1, 1} as 



A ! - X K h +l ~ X K h > 



which represents the move of the walk at time K^, that is, the move at the z'th 
time where the history of the currently occupied site is h. 

According to Proposition 4 and Corollary 2 in lAdelman and Enriquea 120041) . the 
random variables A* 1 are i.i.d. and we have 

Y tn 

E 1 {A h =i} = ^i( h ) P*-a.s., (25) 



m— oo m 



i=l 
j m 



and lim — V l, A n__ 11 = V-i(h) P*-a.s., (26) 

m^oo ffi r^i 1 i ~ ' 

where 

E* [a>l +h+ (1 - w ) fc -] E* (1 - Wo) 1+?! -] 

Vi(h) = -. and V"_i(h) = 



E*[wJ + (l-Wo) fc -] E*[^ + (l-w ) ? H 

The quantities Vi(h) and V_i(h) are the annealed right and left transition prob- 
abilities from the currently occupied site with history h. In particular, in our 
case Vi(h) + V_i(h) = 1. The consequence of the previous convergence result is 
that by letting the histories h vary, we can potentially recover all the moments 
of the distribution v an d thus this distribution itself. The strategy underlying 



Adelman and Enriquezf s approach is then to estimate some well-chosen mo- 



ments V\ (h) or VI i (h) so as to obtain a set of equations which has to be inverted 
to recover parameter estimates. 

We thus define Mjj and for e = ± 1 the estimators V" (h) as 

1 M " 

M* = sup{kf<T n :i>l}, V e n (h) = — E £l {A h =e) , e = ±1. 

The quantity V"{h) is either the proportion of sites from which the first move 
is to the right (e = 1) or to the left (e = -1), among those with history h. (In 
particular, V^fli) + V^{h) = 1.) Then, from (25) and (26j and the fact that T n 
goes to infinity P* -almost surely when n grows to infinity, we get 

lim V" (h) = V e (h) P* - almost surely. 

Hence, we can estimate 6* by the solution of an appropriate system of equa- 
tions, as illustrated below. 
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Example!] (continued). In this case the parameter 8 equals p and we have 
Vi(0,0) = E*[£Oo] = p*a 1 + (l-p*)a 2 . 

Hence, among the visited sites (namely sites with history h = (0,0)), the pro- 
portion of those from which the first move is to the right gives an estimator for 
p* a\ + (1 - p*)a 2 . Using this observation, we can estimate p*. 

Examplelnl (continued) . In this case the parameter 6 equals {p, a\, a-£) and we 
may for instance consider 

V^i (0,0) = p*a* + {l-p*)a*, 

V l (0, 1) = {p* [a*] 2 + (1 - p*) [a*] 2 } ■ Vi (0, 0)" 1 , (27) 
Vi (0,2) = {p*la*f + (1 - p*) la*] 3 } ■ V l (0, l)" 1 . 

Hence, among the visited sites (sites with history h = (0,0)), the proportion of 
those from which the first move is to the right gives an estimator for p* a* + (1 - 
p*)a*. Among the sites visited at least twice from which the first move is to the 
right (sites with history h = (0, 1)), the proportion ofthose from which the second 
move is also to the right gives an estimator for p*la*] 2 + (1 - p*)la*} 2 . Among 
the sites visited at least three times from which the first and second moves are 
to the right (sites with history h = (0,2)), the proportion of those from which 
the third move is also to the right gives an estimator for p* la*] 3 + (1 - p*) [a£ ] 3 . 
Using these three observations, we can theoretically estimate p* , a* and a*, 
as soon as the solution to this system of three nonlinear equations is unique. 
Note that inverting the mapping defined by ( f27T > is not trivial. Moreover, while 
the moment estimators might have small errors, inverting the mapping might 
result in an increase of this error for the parameter estimates. 



Example Bill (continued). In this case, the parameter 6 equals (a,/3) and we 
have 

P-i (0,0) = / - and VLi(l,0) = - 



a* + /3* a* + p* + l 

Hence, among the visited sites (sites with history h = (0,0)), the proportion of 
those from which the first move is to the left gives an estimator for a f + p* ■ Among 
the sites visited at least twice from which the first move is to the left (sites with 
history h = (1,0)), the proportion of those from which the second move is also 
to the left gives an estimator for a * + p} +1 ■ Using these two observations, we can 
estimate a* and j6*. 



5.2 Experiments 

We now present the three simulation experiments corresponding respectively to 
Examples HI to UlTl Note that in Example HJ the set of Equations (27j may not be 
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trivially inverted to obtain the parameter 6 = (p, a\, a-i). In particular, we were 
not able to perform (even only n umerically) the mapping inversion needed to 
compute lAdelman and Enriquea s estimator in this case. Thus, in the experi- 
ments presented below, we choos e to only consider our e stimation procedure 



in this case. The comparison with lAdelman and 



nriauez's procedure is given 



only for Examples HI and [TTT1 In those cases, while lAdelman and Enriquea s pro- 
cedure may be easily performed, we already obtain much better estimates with 
our approach. Since inverting the set of Equati ons (27ll will increase th e uncer- 
tainty of the moment estimators, we claim that lAdelman and Enriquea s proce- 
dure would do even worse in this case. 

For each of the three simulations, we a priori fix a parameter value 6* as given 
in TablefTJand repeat 1 000 times the procedure described below. We first gen- 
erate a random environment according to vq* on the set of sites {-10 4 10 4 }. 

In fact, we do not use the environment values for all the 10 4 negative sites, since 
only few of these sites are visited by the walk. However the computation cost is 
very low comparing to the rest of the estimation procedure, and the symmetry 
is convenient for programming purpose. Then, we run a random walk in this 
environment and stop it successively at the hitting times T n defined by (2} , with 
n g {10 3 k; l<k< 10}. For ea ch stop, we estimate 9* according to our procedure 
and lAdelman and Enriquea s one (except for the second simulation). In all three 
cases, the parameters in Table[T]are chosen such that the RWRE is transient and 
ballistic to the right. Note that the length of the random walk is not n but rather 
T n . This quantity varies considerably throughout the three setups and the dif- 
ferent iterations. Figure Q] shows (frequency) histograms of the hitting times T n 
for some selected values n {n = 1 000, 5 000 and 10 000), obtained from 1 000 
iterations of the procedures in each of the three different setups. 



Simulation 


Fixed parameter 


Estimated parameter 


Example[I] 


(fli,fl 2 ) = (0.4,0.7) 


p* = 0.3 


Example[ll] 




{a*,a*,p*) = (0.4,0.7,0.3) 


Example Hill 




(a* jS*) = (5, 1) 



Table 1: Parameter values for each experiment. 



Figure [2] shows the boxplots of our estimator and lAdelman and Enriquea s esti- 
mator obtained from 1 000 iterations of the procedures in the two Examples U 
andUm while Figure [3] only displays these boxplots for our estimator in Exam- 
ple HIl First, we shall notify that in order to simplify the visualisation of the re- 
sults, we removed in the boxplots corresponding to Example U (Bottom panel 
of Figure [2} about 0.8% of outliers values from our estimator, that where equal 
to 1. Indeed in those cases, the likelihood optimisation procedure did not con- 
verge, resulting in the arbitrary value p = 1. In t he same way for Examp le ITm 
we removed from the figure parameter values of lAdelman and Enriquea s esti- 
mator that were too large. It corresponds to about 0.7% of values a larger than 
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10 (for estimating a* = 5) and about 0.2% of values /3 larger than 3 (for estimat- 
ing fi* = 1). In the following discussion, we neglect these rather rare numerical 
issues. We first observe that the accuracies of the procedures increase with the 
value of n and thus the walk length T n . We also note that both procedures are 
unbiased. The main difference comes when considering the variance of each 
procedure (related to the width of the boxplot s): our procedure exhibits a much 
smaller variance than Adelman and Enrique j's on e as well as a smaller number 
of outliers. We stress that lAdelman and Enriquea s estimator is expected to ex- 
hibit its best performances in Examples HI and UTIl that are considered here. In- 
deed, in these cases, inverting the system of equations that link the parameter 
to the moments distribution is particularly simple. 
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Figure 1: Histograms of the hitting times T n obtained from 1 000 iterations in 
each of the two setups and for values n equal to 1 000 (white), 5 000 (grey) and 
10 000 (hatched). Top panel: Example^ bottom panel: ExampleHTll 
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Figure 2: Boxplots of our estimator (left and white) and lAdelman and Enriquezf s 
estimator (right and grey) obtained from 1 000 iterations and for values n rang- 
ing in {10 3 fc; 1 < k < 10} (x-axis indicates the value k). Top panel displays es- 
timation of p* in ExampleU Second and third panels display estimation of a* 
(second panel) and f)* (third panel) in ExampleUm The true values are indicated 
by horizontal lines. 
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Figure 3: Boxplots of our estimator obtained from 1 000 iterations in Example|ll] 
and for values n ranging in {10 3 k; 1 < k < 10} (x-axis indicates the value k). Es- 
timation of a* (top panel), (middle panel) and p* (bottom panel). The true 
values are indicated by horizontal lines. 
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